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(57) ABSTRACT 

A communication system including at least two functional 
blocks, wherein an first functional block communicates with 
a second functional block by establishing a connection, 
wherein a connection is a logical state in which data may 
pass between the first functional block and the second 
functional block. One embodiment includes a bus coupled to 
each of the functional blocks and configured to carry a 
plurality of signals. The plurality of signals includes a 
connection identifier that indicates a particular connection 
that a data transfer is part of, and a thread identifier that 
indicates a transaction stream that the data transfer is part of. 
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COMMUNICATIONS SYSTEM AND 

METHOD WITH MULTILEVEL 
CONNECTION IDENTIFICATION 

FIELD OF THE INVENTION 

The present invention relates to a communication system 
to couple computing sub-systems. 

BACKGROUND OF THE INVENTION 

Electronic computing and communications systems con- 
tinue to include greater numbers of features and to increase 
in complexity. At the same time, electronic computing and 
communications systems decrease in physical size and cost 
per function. Rapid advances in semiconductor technology 
such as four-layer deep-sub -micron complimentary metal- 
oxide semiconductor (CMOS) technology, have enabled true 
"system-on-a-chip" designs. These complex designs may 
incorporate, for example, one or more processor cores, a 
digital signal processing (DSP) core, several communica- 
tions interfaces, and graphics support in application-specific 
logic. In some systems, one or several of these extremely 
complex chips must communicate with each other and with 
other system components. Significant new challenges arise 
in the integration, verification and testing of such systems 
because efficient communication must take place between 
sub-systems on a single complex chip as well as between 
chips on a system board. One benefit to having an efficient 
and flexible method for communication between sub- 
systems and chips is that system components can be reused 
in other systems with a minimum of redesign. 

One challenge in the integration, verification and testing 
of modern electronic systems stems from the fact that 
modern electronic systems in many application areas have 
functionality, cost and form-factor requirements that man- 
date the sharing of resources, such as memory, among 
multiple functional blocks, where functional blocks can be 
any entity that interfaces to a communication system. In 
such systems, the functional blocks typically possess differ- 
ent performance characteristics and requirements, and the 
communications system and shared resources must simul- 
taneously satisfy the total requirements. Key requirements 
of typical functional blocks are bandwidth and latency 
constraints that can vary over several orders of magnitude 
between functional blocks. In order to simultaneously sat- 
isfy constraints that vary so widely, communications sys- 
tems must provide high degrees of predictability. 

Traditional approaches to the design of communications 
systems for modern, complex computer systems have vari- 
ous strengths and weaknesses. An essential aspect of such 
approaches is the communications interface that various 
sub-systems present to one another. One approach is to 
define customized point-to-point interfaces between a sub- 
system and each peer with which it must communicate. This 
customized approach offers protocol simplicity, guaranteed 
performance, and isolation from dependencies on unrelated 
sub-systems. Customized interfaces, however, are by their 
nature inflexible. The addition of a new sub-system with a 
different interface requires design rework. 

A second approach is to define a system using standard- 
ized interfaces. Many standardized interfaces are based on 
pre-established computer bus protocols. The use of com- 
puter buses allows flexibility in system design, since as 
many different functional blocks may be connected together 
as required by the system, as long as the bus has sufficient 
performance. It is also necessary to allocate access to the bus 
among various sub-systems. In the case of computer buses, 
resource allocation is typically referred to as arbitration. 
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One disadvantage of computer buses is that each sub- 
system or component connected to the bus is constrained to 
use the protocol of the bus. In some cases, this limits the 
performance of the sub-system. For example, a sub-system 

5 may be capable of handling multiple transaction streams 
simultaneously, but the bus protocol is not capable of fully 
supporting concurrent operations. In the case of a sub- 
system handling multiple transaction streams where each 
transaction stream has ordering constraints, it is necessary 

10 for the sub-system to identify each increment of data 
received or transmitted with a certain part of a certain data 
stream to distinguish between streams and to preserve order 
within a stream. This includes identifying a sub-system that 
is a source of a data transmission. Conventionally, such 

15 identification is limited to a non-configurable hardware 
identifier that is generated by a particular sub-system or 
component. 

Current bus systems provide limited capability to preserve 
order in one transaction stream by supporting "split trans- 

20 actions" in which data from one transaction may be inter- 
leaved with data from another transaction in the same 
stream. In such a bus, data is tagged as belonging to one 
stream of data, so that it can be identified even if it arrives 
out of order. This requires the receiving sub-system to 

25 decode an arriving address to extract the identification 
information. 

Current bus systems do not support true concurrency of 
operations for a sub-system that can process multiple 
streams of transactions over a single interconnect, such as a 

3D 

memory controller that handles access to a single dynamic 
random access memory (DRAM) for several clients of the 
DRAM. A DRAM controller may require information 
related to a source of an access request, a priority of an 
access request, ordering requirements, etc. Current commu- 

35 nication systems do not provide for such information to be 
. transmitted with data without placing an additional burden 
on the sub -system to adapt to the existing protocol. 

In order for many sub-systems to operate in conventional 

40 systems using all of their capabilities, additional knowledge 
must be designed into the sub -systems to provide commu- 
nication over existing communication systems. This makes 
sub -systems more expensive and less flexible in the event 
the sub-system is later required to communicate with new 

45 sub-systems or components. Existing communication 
approaches thus do not meet the requirements of today's 
large, complex electronics systems. Therefore, it is desirable 
for a communications system and mechanism to allow 
sub-systems of a large, complex electronics system to inter- 

5Q operate efficiently regardless of their varying performance 
characteristics and requirements. 

SUMMARY OF THE INVENTION 

One embodiment of the present invention includes a 
55 shared communications bus for providing flexible commu- 
nication capability between electronic sub-systems. One 
embodiment includes a protocol that allows for identifica- 
tion of data transmissions at different levels of detail as 
required by a particular sub-system without additional 
60 knowledge being designed into the sub-system. 

One embodiment of the invention includes several func- 
tional blocks, including at least one initiator functional block 
and one target functional block. Some initiator functional 
blocks may also function as target functional blocks. In one 
65 embodiment, the initiator functional block is coupled to an 
initiator interface module and the target functional block is 
coupled to a target interface module. The initiator functional 
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block and the target functional block communicate to their semiconductor devices to communicate to each other 

respective interface modules and the interface modules through a shared off-chip communications resource, such as 

communicate with each other. The initiator functional block a bus. 

communicates with the target functional block by establish- i n 0 ne embodiment, the present invention is a pipelined 

ing a connection, wherein a connection is a logical state in 5 communications bus with separate command, address, and 

which data may pass between the initiator functional block data wires. Alternative embodiments include a pipelined 

and the target functional block. communications bus with multiplexed address, data, and 

One embodiment also includes a bus configured to carry control signals. The former embodiment offers higher per- 

multiple signals, wherein the signals include a connection formance and simpler control than the latter embodiment at 

identifier signal that indicates a particular connection that a 30 the expense of extra wires. The former embodiment may be 

data transfer between an initiator functional block and a' more appropriate for on-chip communications, where wires 

target functional block is part of. The connection identifier are relatively less expensive and performance requirements 

includes information about the connection, such as which are usually higher. The latter embodiment offers higher 

functional block is the source of a transmission, a priority of per-wire transfer efficiency, because it shares the same wires 

a transfer request, and transfer ordering information. One 35 among address and data transfers. The latter embodiment 

embodiment also includes a thread identifier, which provides may be more appropriate for chip-to-chip communications • 

a subset of the information provided by the connection between semiconductor devices, because package pins and 

identifier. In one embodiment, the thread identifier is an board traces increase the per signal cost, while total required 

identifier of local scope that identifies transfers between an communications performance is usually lower, 

interface module and a connected functional block, where in 20 RQ j ^ a block diagram of a complex electronics system 

some embodiments, an interface module connects a func- ioq. Shared communications bus 112 connects sub-systems 

tional block to a shared communications bus. K)2, 104, 106, 108, and 110. Sub-systems are typically 

The connection identifier is a an identifier of global scope functional blocks including a interface module for interfac- 
that transfers information between interface modules or ing to a shared bus. Sub-systems may themselves include 
between functional blocks through their interface modules. 25 one or more functional blocks and may or may not include 
Some functional blocks may require all the information an integrated or physically separate interface module. In one 
provided by the connection identifier, while other functional embodiment, the sub-systems connected by communica- 
blocks may require only the subset of information provided tions bus 112 are separate integrated circuit chips. Sub- 
by the thread identifier. system 104 is an application specific integrated circuit 

30 (ASIC) which, as is known, is an integrated circuit designed 

BRIEF DESCRIPTION OF THE DRAWINGS t0 perform a particular function. Sub-system 106 is a 

FIG. 1 is a block diagram of one embodiment of a dynamic random access memory (DRAM). Sub-system 108 

complex electronics system according to the present inven- 15 an erasable, programmable, read only memory (EPROM). 

t j on Sub-system 110 is a field programmable gate array (FPGA). 

rT „ u j- . r • * * ji 35 Sub-system 102 is a fully custom integrated circuit designed 

FIG. 2 is an embodiment of an interf ace module. „ . t : 1A „ . & t 

specifically to operate in system 100. Other embodiments 

FIG. 3 is an embodiment of an interface module. may conta i n additional sub-systems of the same types as 

FIG. 4 is an embodiment of a communications bus. shown, or other types not shown. Other embodiments may 

FIG. 5 is a timing diagram showing pipelined write al so include fewer sub-systems than the sub-systems shown 

transfers 40 in system 100. Integrated circuit 102 includes sub-systems 

FIG. 6 is a timing diagram showing rejection of a first 102A > l02B > 102C > 102D and l02E ASIC 104 includes 

pipelined write transfer and a successful second write trans- functional blocks 101A, 104B and 104C. FPGA 110 

j er includes functional blocks 110 A and HOB. A functional 

™~ - . l • • i ■ r block may be a particular block of logic that performs a 

FIG. 7 is a tuning diagram showing interleaving of 45 i r *■ a c i li i i l 

j . . 4 & r particular function. A functional block may also be a 

pipelined read and write transfers. , . , . . . . ' 

y * memory component on an integrated circuit. 

FIG. 8 is a timing diagram showing interleaved connec- c , lftA . , c t . , - 

. • i ♦ . System 100 is an example of a system that may consist of 

tions to a single target. ' . 4 * j • < . A c , . , , 

& & one or more integrated circuits or chips. A functional block 

FIG. 9 is a timing diagram showing interleaved connec- may be a logic block on an integrated circuit such as, for 

tions from a single initiator. example, functional block 102E, or a functional block may 

FIG. 10 is a block diagram of one embodiment of part of also be an integrated circuit such as fully custom integrated 

a computer system. circuit 102 that implements a single logic function. 

FIG. 11 is one embodiment of a communications bus. Shared communications bus 112 provides a shared com- 

FIG. 12 is a block diagram of one embodiment of part of 55 munications bus between sub-systems of system 100, 

a computer system. Shared communication bus 114 provides a shared commu- 
nications bus between sub-systems or functional blocks on 

DETAILED DESCRIPTION a single integrated circuit. Some of the functional blocks 

The present invention is a communications system and shown are connected to interface modules through which 
method for allowing multiple functional blocks or sub- 60 thev send and receive signals to and from shared commu- 
systems of a complex electronics system to communicate nications bus 112 or shared communications bus 114, Inter- 
with each other through a shared communications resource, connect 115 is a local point-to-point interconnect for con- 
such as a shared communications bus. In one embodiment, necting interface modules to functional blocks, 
a communications protocol allows multiple functional block Interface modules 120-127 are connected to various 
on a single semiconductor device to communicate to each 65 functional blocks as shown. In this embodiment, interface 
other. In another embodiment, the communications protocol modules 120, 122, 123 and 124 are physically separated 
may be used to allow multiple functional blocks on different from their connected functional block (A, B, C, E and F, 
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respectively). Interface modules 121, and 125-128 are 
essentially part of their respective functional blocks or 
sub-systems. Some functional blocks, such as 102D, do not 
require a dedicated interface module. The arrangement of 
sub-systems, functional blocks and interface modules is 
flexible and is determined by the system designer. 

In one embodiment there are four fundamental types of 
functional blocks. The four fundamental types are initiator, 
target, bridge, and snooping blocks. A typical target is a 
memory device, a typical initiator is a central processing unit 
(CPU). A typical bridge might connect shared communica- 
tions buses 112 and 114. Functional blocks all communicate 
with one another via shared communications bus 112 or 
shared communications bus 114 and the protocol of one 
embodiment. Initiator and target functional blocks may 
communicate a shared communications bus through inter- 
face modules. An initiator functional block may communi- 
cate with a shared communications bus through an initiator 
interface module and a target functional block may com- 
municate with a shared communications bus through a target 
interface module. 

An initiator interface module issues and receives read and 
write requests to and from functional blocks other than the 
one with which it is associated. In one embodiment, an 
initiator interface module is typically connected to a CPU, a 
digital signal processing (DSP) core, or a direct memory 
access (DMA) engine. 

FIG. 2 is a block diagram of an embodiment of an initiator 
interface; module 800. Initiator interface module 800 
includes clock generator 802, data flow block 806, arbitrator 
block 804, address/command decode block 808, configura- 
tion registers 810, and synchronizer 812. Initiator interface 
module 800 is connected to a shared communications bus 
814 and to an initiator functional block 816. In one 
embodiment, shared communications bus 814 is a shared 
communications bus that connects sub-systems, as bus 112 
does in FIG. 1. 

Clock generator 802 is used to perform clock division 
when initiator functional block 816 runs synchronously with 
respect to shared communications bus 814 but at a different 
frequencies. When initiator functional block 816 runs asyn- 
chronously with respect to communications bus 814, clock 
generator 802 is not used, but synchronizer 812 is used. 

Arbitrator block 804 performs arbitration for access to 
shared communications bus 814. In one embodiment, a 
multi-level arbitration scheme is used wherein arbitrator 
module 804 includes logic circuits that manage pre-allocated 
bandwidth aspects of first level arbitration and also logic that 
manages second level arbitration. Data flow block 806 
includes data flow first-in first-out (FIFO) buffers between 
shared communications bus 814 and initiator functional 
block 816, in addition to control logic associated with 
managing a transaction between shared communications bus 
814 and initiator functional block 816. The FIFO buffers 
stage both the address and data bits transferred between 
shared communications bus 814 and initiator functional 
block 816. In one embodiment, shared communications bus 
814 implements a memory mapped protocol. Specific details 
of an underlying computer bus protocol are not significant to 
the invention, provided that the underlying computer bus 
protocol supports some operation concurrency. A preferred 
embodiment of a bus protocol for use with the present 
invention is one that supports retry transactions or split 
transactions, because these protocols provide a mechanism 
to deliver operation concurrency by interrupting a multi- 
cycle transaction to allow transfers belonging to other unre- 
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lated transactions to take place. These protocols allow for 
higher transfer efficiencies because independent transactions 
may use the bus while an initiator waits for a long latency 
target to return data that has been previously requested by an 
5 initiator. 

Address/command decode block 808 decodes an address 
on shared communications bus 814 to determine if a write is 
to be performed to registers associated with initiator func- 
tional block 816. Address/command decode block 808 also 

10 decodes incoming commands. Configuration registers 810 
store bits that determine the state of module 800, including 
bandwidth allocation and client address base. One register 
810 stores an identification (ID) which is a set of bits 
uniquely identifying initiator functional block 816. 

35 FIG. 3 is a block diagram of an embodiment of a target 
interface module 900. Target interface module 900 is con- 
nected to shared communications bus 914 and to target 
functional block 918. Target interface module 900 includes 
clock generator 902, data flow block 906, address/command 

20 decode block 908, synchronizer 912, and state registers in 
state control block 916. Blocks of target interface module 
900 that are named similarly to blocks of initiator module 
800 function in substantially the same way as explained with 
respect to initiator block 800. State registers and state 

25 control block 916 include registers that store, for example, 
client address base and an identifier for target functional 
block 918. 

In one embodiment, an initiator functional block such as 

30 initiator functional block 816 may also act as a target 
functional block in that it has the capability to respond to 
signals from other functional blocks or sub-systems as well 
as to initiate actions by sending signals to other functional 
blocks or sub-systems, 

35 FIG. 4 is a block diagram of a part of a computer system 
1000 according to one embodiment. FIG. 4 is useful in 
illustrating multilevel connection identification. System 
1000 includes initiator functional block 1002, which is 
connected to initiator interface module 1004 by interconnect 

40 1010. Initiator interface module 1004 is connected to target 
interface module 1006 by shared communications bus 1012. 
Target interface module 1006 is connected to target func- 
tional block 1008 by an interconnect 1010. Typically, shared 
communications bus 1012 is analogous to shared commu- 

45 nications bus 112 of FIG. 1 or to shared communications bus 
114 of FIG. 1. Interconnects 1010 are typically analogous to 
interconnect 115 of FIG. 1 in that they connect functional 
blocks to interface modules and are point-to-point, rather 
than shared, interconnects. Interconnects 1010 are typically 

50 physically shorter than shared communications bus 1012 
because of their local nature. As will be explained more fully 
below, system 1000 uses two different levels of connection 
identification depending upon the requirements of a particu- 
lar functional block. "Global" connection identification 

55 information is sent on shared communications bus 1012, 
while "local" connection information, or thread identifica- 
tion information, is sent in interconnects 1010. 

FIG. 5 is a block diagram of one embodiment of a shared 
communications bus 1012. Shared communications bus 

60 1012 is shown connected to entities A, B, C, D and E, which 
may be interface modules, functional blocks, or a combina- 
tion of both. Shared communications bus 1012 is composed 
of a set of wires. Data wires 230 provide direct, high 
efficiency transport of data traffic between functional blocks 

65 on shared communications bus 1012. In one embodiment, 
shared communications bus 1012 supports a bus protocol 
that is a framed, time division multiplexed, fully pipelined, 
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fixed latency communication protocol using separate target is returning requested data on data lines 230. In the 

address, data and connection identification wires. The bus case of a write, a DVA response indicates that the target is 

protocol supports fine grained interleaving of transfers to accepting the provided data from data lines 230. A BUSY 

enable high operation concurrency, and uses retry transac- response indicates that the selected target has a resource 

tions to efficiently implement read transactions from target 5 conflict and cannot service the current request. In this case 

devices with long or variable latency. Details of the arbitra- an initiator should reattempt the transfer again later. A 

lion method used to access shared communications bus 1012 RETRY response indicates that the selected target could not 

are not required to understand the present invention. The deliver the requested read data in time, but promises to do 

delay from when an initiator functional block drives the so at a later time. In this case an initiator must reattempt the 

command and address until the target functional block ]0 transfer at a later time. 

drives the response is known as the latency of shared Connection identifier (CONNID) lines 226 carry a multi- 
communications bus 1012. The bus protocol supports arbi- bit signal driven by an initiator bus owner to indicate which 
tration among many initiator functional blocks and target connection the current transfer is part of. A connection is a 
functional blocks for access to the bus. In the embodiment logical state, established by an initiator, in which data may 
shown, arbitration for access to shared communications bus 1S pass between the initiator and an associated target. The 
1012 is performed by an initiator interface module, such as CONNID typically transmits information including the 
module 1004 of FIG. 4, In other embodiments, arbitration is identity of the functional block initiating the transfer and 
performed by functional blocks directly, or by a combination ordering information regarding an order in which the trans- 
of. interface modules and functional blocks. In one fer must be processed. In one embodiment, the information 
embodiment, a bus grant lasts for one pipelined bus cycle. 20 conveyed by the CONNID includes information regarding 
The protocol does not forbid a single functional block from the priority of the transfer with respect to other transfers. In 
becoming a bus owner for consecutive bus cycles, but does one embodiment the CONNID is a eight-bit code. An 
require that the functional block successfully win arbitration initiator interface module sends a unique CONNID along 
on consecutive cycles to earn the right. with an initial address transfer of a connection. Later trans- 
Shared communications bus 1012 includes separate 2 5 fers associated with this connection (for example, data 
address, data, and control wires. Other embodiments may transfers) also provide the CONNID value so both sender 
include multiplexed address, data, and control signals that and receiver (as well as any device monitoring transfers on 
share a wire or wires. Such an embodiment would provide shared communications bus 1012) can unambiguously iden- 
high per- wire transfer efficiency because wires are shared tify transfers associated with the connection. One advantage 
among address and data transfers. A non-multiplexed 30 of using a CONNID is that transfers belonging to different 
embodiment of shared communications bus 1012 may be transactions can be interleaved arbitrarily between multiple 
more appropriate for communication between functional devices on a per cycle basis. In one embodiment, shared 
blocks on a single integrated circuit chip because wires are communications bus 1012 implements a fully pipelined 
relatively inexpensive and performance requirements are protocol that requires strict control over transaction ordering 
usually higher on a single integrated circuit chip. 35 in order to guarantee proper system operation. Without the 
Clock line 220 is a global signal wire that provides a time use of a CONNID, ordering constraints within a particular 
reference signal to which all other shared communications transaction may be violated because transfers associated 
bus 1012 signals are synchronized. Reset line 222 is a global with a particular connection are not identified, 
signal wire that forces each connected functional block into Because a first command may be rejected by a BUSY 
a default state from which system configuration may begin. 40 response while a later command is already in flight, it is 
Command line 224 carries a multi-bit signal driven by an essential to provide mechanisms that allow full control over 
initiator bus owner. In various embodiments, the multi-bit which commands complete. If such control is not present, 
command signal may convey various types of information. ambiguous system behavior can result. For instance, if a 
For example, a command signal may indicate a transfer type, single initiator interface module issues a sequence of depen- 
information regarding duration of a connection, and 45 dent read and write commands, a busy response to one of the 
expected initiator and target behavior during the connection. commands could result in later commands returning the 
In one embodiment, the command signal includes one or wrong data. One solution to such problems is to avoid 
more bits indicating the beginning and end of a connection. overlapping dependent commands. This solution, however, 
In one embodiment, for example, one bit may indicate the increases the latency of every dependent command in order 
status of a connection. If the bit is zero, the current transfer 50 to ensure proper results. The present invention uses a 
is the final transfer in the connection. After the receipt of a CONNID signal, in part, to allow overlapping of dependent 
zero connection status bit, the next receipt of a connection commands. Therefore, use of a CONNID improves system 
status bit that is a logic one indicates that the transfer is the performance and efficiency. Another benefit of the CONNID 
first in a newly opened connection. Each subsequently of the present invention is that communication system 
received one connection status bit then indicates that the 55 predictability is enhanced because it allows a shared func- 
connection is still open. tional block to respond to requests based upon quality of 
Supported transfer types in this embodiment include, but service guarantees that may vary between connections. For 
arc not limited to read and write transfers. Address lines 228 example, data requested to operate a computer display 
carry a multi-bit signal driven by an initiator bus owner to cannot tolerate unpredictable delay because delay causes the 
specify the address of the object to be read or written during 60 display to flicker. Therefore, the CONNID may be used to 
the current transfer. Response lines 232 carry a multi-bit prioritize data requests from a display controller so that 
signal driven by a target to indicate the status of the current requests from the display controller to a common resource . 
transfer. Supported responses include, but are not limited to are serviced before other requests. The present invention 
the following responses. A NULL response indicates that the also allows for flexible reconfiguration of the CONNID to 
current transfer is to be aborted, presumably because the 65 retune system performance. 

address does not select any target. A data valid and accepted FIG. 6 is a timing diagram of a pipelined write transaction 

(DVA) response indicates, in the case of a read, that the consisting of two write transfers on shared communications 
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bus 1012. Reference may also be made to FIG. 5. A single that A also reject (using BUSY) any other pipelined transfers 

pipelined bus transfer, as shown in FIG. 6, includes an from the same CONNID (in this case, CONNID 1), since the 

arbitration cycle (not shown), followed by a command/ initiator cannot possibly know about the resource conflict 

address/CONNID (CMD 324/ADDR 328/CONNID 326) until after the REQ-RESP latency has passed. Thus, target A 

cycle (referred to as a request, or REQ cycle), and completed 5 must BUSY the WRITE ADDRE1 that is issued in cycle 3, 

by a DATA 330/RESP 342 cycle (referred to as a response, because it has the same CONNID and was issued before the 

or RESP cycle). In one embodiment, the number of cycles initiator could interpret the BUSY response to the first write 

between a REQ cycle and a RESP cycle is chosen at system transfer, and is thus a pipelined transfer. Furthermore, the 

implementation time based upon the operating frequency second attempt (issued in cycle 4) of the WRITE ADDREO 

and module latencies to optimize system performance. The 10 transfer is allowed to complete because it is not a pipelined 

REQ-RESP latency, in one embodiment, is two cycles and is transfer, even though it overlaps the cycle 3 WRITE 

labeled above the DATA 330 signal line on FIG. 6. ADDRE1 transfer. 

Therefore, a complete transfer time includes four shared Note that target A determines that the cycle 4 write is not 

communications bus 1012 cycles, arbitration, request, delay pipelined with any earlier transfers because of when it 

and response. 15 occurs and which CONNID it presents, and not because of 

Two transfers are shown in FIG. 6. On cycle 1, initiator E either the CMD nor the ADDR values. Step 1 of the 

drives REQ fields 340 to request a WRITE transfer to algorithm guarantees that an initiator will only issue a 

address ADDREO. This process is referred to as issuing the transfer that is the oldest non-issued, non-retired transfer 

transfer request. In one embodiment, a single target is within a given connection. Thus, once the first WRITE 

selected to receive the write data by decoding an external 20 ADDRE0O receives the BUSY response in cycle 3, it is no 

address portion of ADDREO. On cycle 3 (a REQ-RESP lon g er issued, and so it becomes the only CONNIDol 

latency later), initiator E drives write data DATAE0 on the transfer eligible for issue. It is therefore impossible for a 

DATA wires; simultaneously, the selected target A drives properly operating initiator to issue a pipelined transfer in 

RESP wires 342 with the DVA code, indicating that A cvc l e 4 » gi ven that an initial c yde 1 transfer received a 

accepts the write data. By the end of cycle 3, target A has 25 BUSY response and the REQ-RESP latency is two cycles, 

acquired the write data, and initiator E detects that target A 0ne embodiment of the initiator maintains a time-ordered 

was able to accept the write data; and the transfer has thus queue consisting of the desired transfers within a given 

completed successfully. CONNID. Each transfer is marked as non-issued and non- 

Meanwhile (i.e. still in cycle 3), initiator E issues a retired * s th f v «? entercd j nt0 queue. It is further marked 

pipelined WRITE transfer (address ADDRE1) to target A. 30 as W^** ^ the immediately older entry in the queue is 

The write data and target response for this transfer both non-retired and addresses the same target; otherwise, the 

occur on cycle 5, where the transfer completes successfully. new transfer * ***** non-pipelmed. Each time a transfer 

Proper operation of many systems and sub-systems rely on ^ u es lt * m ^ d as lssued c W ^ e " a transfer ^mpletes (i.e 

the proper ordering of related transfers. Thus, proper system when the RESP cycle is finished) the transfer is marked 

operation may require that the cycle 3 WRITE complete 35 ™n-issued. K the transfer completes successfully, it is 

after the cycle 1 WRITE transfer. In FIG. 6, the CONNID mark f ed as retired and ma ? be deleted fiom the queue, f the 

field conveys crucial information about the origin of the transfer doe f ^t complete successfully, it will typically be 

transfer that can be used to enforce proper ordering. A ^-attempted, and thus can go back into arbitration for 

preferred embodiment of ordering restrictions is that the re -^ ue - If L lhe transfer d ° es L not ™ m P le t e successful y, and 

initiator and target collaborate to ensure proper ordering, 40 ^ will not be re-attempted then it should not be marked as 

even during pipelined transfers. This is important, because retired untl1 the next transfer ' lf 11 exists > 15 not marked as 

transfer pipelining reduces the total latency of a set of issued ' ^ r f estnctl0n P revents the imtiator lo S lc fr ° m 

transfers (perhaps a single transaction), thus improving ^suing out of order. As the oldest non-Retired transfer 

system performance (by reducing latency and increasing * sucs > u * marked 88 issued * 71115 aUows the second-oldest 

usable bandwidth) 45 non - reureci transfer to arbitrate to issue until the older 

t ' . , „ , transfer completes (and is thus marked as non-issued), if it 

According to the algorithm of one embodiment: ^ marked as pipelined 

1. An initiator may issue a transfer Y: An embodiment of the target implementation maintains a 

a) if transfer Y is the oldest, non-Issued, non-retired time-ordered queue whose depth matches the REQ-RESP 
transfer among the set of transfer requests it has with 50 latency. The queue operates off of the bus clock, and the 
matching CONNID, or oldest entry in the queue is retired on each bus cycle; 

b) if all of the older non-retired transfers with matching simultaneously, a new entry is added to the queue on each 
CONNID are currently issued to the same target as bus cycle. The CONNID from the current REQ phase is 
transfer Y If issued under this provision, transfer Y cop i et i j nt0 tne new queue enlry . j n addition, if the current 
is considered pipelined with the older non-retired 55 req phase contains a valid transfer that selects the target 
transfers. (via the External Address), then "first" and "busy" fields in 

2. A target that responds to a transfer X in such a way that the new queue entry may be set; otherwise, the first and busy 
the initiator might not retire the transfer must respond bits are cleared. The first bit will be set if the current transfer 
BUSY to all later transfers with the same CONNID as will receive a BUSY response (due to a resource conflict) 
transfer X that are pipelined with X, 60 and no earlier transfer in the queue has the same CONNID 

Note that an older transfer Y that is issued after a newer and has its first bit set. The first bit implies that the current 

transfer X with matching CONNID is not considered pipe- transfer is the first of a set of potentially-pipelined transfers 

lined with X, even if Y Issues before X completes. This that will need to be BUSY'd to enforce ordering. The busy 

situation is illustrated in FIG. 7. If target A has a resource bit is set if either the target has a resource conflict or one of 

conflict that temporarily prevents it from accepting DAIAE0 65 the earlier transfers in the queue has the same CONNID and 

associated with the WRITE ADDREO from cycle 1, then A has the first bit set. This logic enforces the REQ-RESP 

responds BUSY. Step 2 of the foregoing algorithm requires pipeline latency, ensuring that the target accepts no pipelined 
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transfers until the initiator can react to the BUSY response 
to the transfer marked first. 

Application of the algorithm to the initiators and targets in 
the communication system provides the ability to pipeline 
transfers (which increases per-connection bandwidth and 5 
reduces total transaction latency) while maintaining trans- 
action ordering. The algorithm therefore facilitates high 
per-connection performance. The fundamental interleaved 
structure of the pipelined bus allows for high system 
performance, because multiple logical transactions may 10 
overlap one another, thus allowing sustained system band- 
width that exceeds the peak per-connection bandwidths. For 
instance, FIG. 8 demonstrates a system configuration in 
which initiator E needs to transfer data to target Aon every 
other bus cycle, while initiator D requests data from target 15 
B on every other bus cycle. Since the communication system 
supports fine interleaving (per bus cycle), the transactions 
arc composed of individual transfers that issue at the natural 
data rate of the functional blocks; this reduces buffering 
requirements in the functional blocks, and thus reduces 20 
system cost. The total system bandwidth in this example is 
twice the peak bandwidth of any of the functional blocks, 
and thus high system performance is realized. 

The present invention adds additional system-level 
improvements in the area of efficiency and predictability. 25 
First, the connection identifier allows the target to be selec- 
tive in which requests it must reject to preserve in-order 
operation. The system only need guarantee ordering among 
transfers with the same CONN1D, so the target must reject 
(using BUSY) only pipelined transfers. This means that the 30 
target may accept transfers presented with other CONNID 
values even while rejecting a particular CONNID. This 
situation is presented in FIG. 9, which adds an interleaved 
read transfer from initiator D to the pipelined write transfer 
of FIG. 7. All four transfers in FIG. 9 select target A, and A 35 
has a resource conflict that prevents successful completion 
of the WRITE ADDREO that issues in cycle 1. While the 
rejection of the first write prevents A from accepting any 
other transfers from CONNID 1 until cycle 4, A may accept 
the unrelated READ ADDRDO request of cycle 2 if A has 40 
sufficient resources. Thus, overall system efficiency is 
increased, since fewer bus cycles are wasted (as would be 
the case if target A could not distinguish between 
connections). 

Second, in one embodiment the connection identifier 45 
allows the target to choose which requests it rejects. The 
target may associate meanings such as transfer priority to the 
CONNID values, and therefore decide which requests to act 
upon based upon a combination of the CONNID value and 
the internal state of the target. For instance, a target might 50 
have separate queues for storing transfer requests of differ- 
ent priorities. Referring to FIG. 9, the target might have a 
queue for low priority requests (which present with an odd 
CONNID) and a queue for high priority requests (which 
present with an even CONNID). Thus, the CONNID 1 55 
WRITE ADDREO request of cycle 1 would be rejected if 
the low-priority queue were full, whereas the CONNID 2 
READ ADDRDO transfer could be completed successfully 
based upon available high-priority queue resources. Such 
differences in transfer priorities are very common in highly- 60 
integrated electronic systems, and the ability for the target to 
deliver higher quality of service to higher priority transfer 
requests adds significantly to the overall predictability of the 
system. 

As FIG. 9 implies, the algorithm described above allows 65 
a target to actively satisfy transfer requests from multiple 
CONNID values at the same time. Thus, there may be 



multiple logical transactions in flight to and/or from the 
same target, provided that they have separate CONNID 
values. Thus, the present invention supports multiple con- 
nections per target functional block. 

Additionally, an initiator may require the ability to present 
multiple transactions to the communications system at the 
same time. Such a capability is very useful for initiator such 
as direct memory access (DMA) devices, which transfer data 
between two targets. In such an application, the DMA 
initiator would present a read transaction using a first 
CONNID to a first target that is the source of the data, and 
furthermore present a write transaction using a second 
CONNID to a second target that is the data destination. At 
the transfer level, the read and write transfers could be 
interleaved. This reduces the amount of data storage in the 
DMA initiator, thus reducing system cost. Such an arrange- 
ment is shown in FIG. 10, where initiator E interleaves 
pipelined read transfers from target A with pipelined write 
transfers to target B. Thus, the present invention supports 
multiple connections per initiator functional block. 

The control structures required to support implementation 
of the present invention, as described above with respect to 
the algorithm, are simple and require much less area than the 
data buffering area associated with traditional protocols that 
do not provide efficient fine interleaving of transfers. Thus, 
the present invention minimizes communication system area 
and complexity, while delivering high performance and 
flexibility. 

Finally, the CONNID values that are associated with 
particular initiator transactions should typically be chosen to 
provide useful information such as transfer priorities but 
also to minimize implementation cost. It is useful to choose 
the specific CONNID values at system design time, so the 
values can be guaranteed to be unique and can be ordered to 
simplify comparison and other operations. Furthermore, it is 
frequently useful to be able to change the CONNID values 
during operation of the communications system so as to alter 
the performance and predictability aspects of the system. 
Preferred implementations of the present invention enable 
flexible system configuration by storing the CONNID values 
in ROM or RAM resources of the functional blocks, so they 
may be readily re-configured at either system build time or 
system run time. 

FIG. 11 shows an interconnect 1010, which is a point-to- 
point interconnect as shown in FIG. 4. Interconnect 1010 
includes additional signals as compared to the protocol 
described with reference to FIG. 5. As will be explained 
below, some of the additional signals are particularly useful 
as signals sent over point-to-point interconnects such as 
interconnects 1010. The protocol of interconnect 1010 con- 
trols point-to-point transfers between a master entity 1102 
and a slave entity 1104 over a dedicated (non-shared) 
interconnect. Referring to FIG. 4, a master entity may be, for 
example, initiator functional block 1002 or target interface 
module 1006. A slave entity may be, for example, initiator 
interface module 1004 or target functional block 1008. 

Signals shown in FIG. 11 are labeled with signal names. 
In addition, some signal names are followed by a notation or 
notations in parentheses or brackets. The notations are as 
follows: 

(I) The signal is optional and is independently config- 
urable 

(A) The signal must be configured together with signals 

having similar notations 
(Al) The signal is independently configurable if (A) 

interface modules exist 
[#] Maximum signal width 
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The clock signal is the clock of a connected functional Request Connection Identifier (ReqConnID) provides the 

block. The command (Cmd) signal indicates the type of CONNID associated with the current transaction intended 

transfer on the interconnect. Commands can be issued for the slave. CONNIDs provide a mechanism by which a 

independent of data. The address (Addr) signal is typically system entity may associate particular transactions with the 

an indication of a particular resource that an initiator func- 5 system entity. One use of the CONNID is in establishing 

tional block wishes to access. Request Accept (ReqAccept) request priority among various initiators. Another use is in 

is a handshake signal whereby slave 1104 allows master ass0 ciating actions or data transfers with initiator identity 

1102 to release Cmd, Addr and DataOut from one transfer rather than the address presented with the transac ti on 

and reuse them for another transfer. If slave 1104 is busy and rea uest 

cannot participate in a requested transfer, master 1102 must 7™ " . ♦ r no n a j * a 

. * t r *o j ajj jn 4 n . n , a j » 10 The embodiment of FIG, 11 provides end-to-end connec- 

continue to present Cmd, Addr and DataOut. DataOut is data t . ., 4 . c 4 . - lL nnxrxnn .. ■ 

sent from a master to a slave, typically in a write transfer tl0n identification with CONNID as wel as point-to-point, 

Dataln typically carries read data or ™ rc ^ntificaUon with Thread ID. A Thread ID is 

Response (Resp) and Dataln are signals sent from slave an identifier of local scope that simply identifies transfers 

1104 to master 1102. Resp indicates that a transfer request between the interface module and its connected functional 

that was received by slave 1104 has been serviced. Response 3S block - Io contrast, the CONNID is an identifier of global 

accept (RespAccept) is a handshake signal used to indicate SC0 P e that identifies transfers between two interface modules 

whether the master allows the slave to release Resp and (and, if required, their connected functional blocks). 

Dataln. A Thread ID should be small enough to directly index 

Signals Clock, Cmd, Addr, DataOut, ReqAccept, Resp, tables within the connected interface module and functional 

Dataln, and RespAccept, in one embodiment, make up a 20 block. In contrast, there are usually more CONNIDs in a 

basic set of interface module signals. For some functional system than any one interface module is prepared to simul- 

blocks, the basic set may be adequate for communication taneously accept. Using a CONNID in place of a Thread ID 

purposes. requires expensive matching logic in the interface module to 

In other embodiments, some or all of the remaining associate a returned CONNID with specific requests or 

signals of bus 1012 may be used. In one embodiment, Width 25 buffer entries. 

is a three-bit signal that indicates a width of a transfer and Using a networking analogy, the Thread ID is a level-2 

is useful in a connection that includes transfers of variable (data link layer) concept, whereas the CONNID is more like 

width. Burst is a multibit signal that allow individual com- a level-3 (transport/session layer) concept. Some functional 

mands to be associated within a connection. Burst provides blocks only operate at level-2, so it is undesirable to burden 

an indication of the nature of future transfers, such as how 30 the functional block or its interface module with the expense 

many there will be and any address patterns to be expected. of dealing with level-3 resources. Alternatively, some func- 

Burst has a standard end marker. Some bits of the Burst field tional blocks need the features of level-3 connections, so in 

are reserved for user-defined fields, so that a connection may this case it is practical to pass the CONNID through to the 

be ignorant of some specific protocol details within a functional block. 

connection. 35 Referring to FIG. 4, a CONNID is required to be unique 

Interrupt and error signals are an important part of most when transferred between interface modules 1004 and 1006 

computer systems. Interrupt and error signals generated by on shared communications bus 1012. The CONNID may be 

initiator or target functional blocks are shown, but the sent over a local interconnect, such as interconnect 1010. In 

description of their functionality is dependent upon the many cases, however, it is much more efficient to use only 

nature of a particular functional block and is not important 40 Thread ID between a functional block and its interface 

to understanding the invention. module. For example initiator functional block 1002 may 

Request Thread Identifier (ReqThreadID), in one not require all the information provided by the CONNID. 

embodiment, is a four-bit signal that provides the thread Also, in some systems, multiple identical initiator functional 

number associated with a current transaction intended for blocks 1002 may exist with the same CONNID so that a 

slave 1104. All commands executed with a particular thread 45 particular target functional block 1008 receiving a transfer 

ID must execute in order with respect to one another, but will not know which connection it is actually part of unless 

they may execute out of order with respect to commands logic in initiator interface module 1004 translates the "local" 

from other threads. Response Thread Identifier CONNID to a unique "global" CONNID. The design and 

(RespThreadID) provides a thread number associated with a implementation of such a translation functionality in an 

current response. Because responses in a thread may return 50 interface module is complicated and expensive. In such 

out of order with respect to other threads, RespThreadID is cases, the CONNID may be sent between interface modules 

necessary to identify which thread's command is being over shared communications bus 1012 while the Thread ID 

responded to. In one embodiment, ReqThreadID and is sent between a functional block and an interface module. 

RespThreadID are optional signals, but if one is used, both In the case of an initiator functional block, a one-to-one 

must be used. 55 static correspondence may exist between Thread ID and 

Request Thread Busy (ReqThreadBusy) allows the slave CONNID. For example if the Thread ID is "1", a single 
to indicate to the master that it cannot take any new requests CONNID is mapped for a particular interface module, 
associated with certain threads. In one embodiment, the solving the problem of multiple, identical functional blocks. 
ReqThreadBusy signal is a vector having one signal per In the case of a target functional block, there is a one-to- 
thread, and a signal asserted indicates that the associated 60 one dynamic correspondence between a Thread ID and a 
thread is busy. CONNID. If a target functional block supports two simul- 

Response Thread Busy (RespThreadBusy) allows the taneous threads, the target interface module acquires the 

master to indicate to the slave that it cannot take any CONNID of an open connection and associates it with a 

responses (e.g., on reads) associated with certain threads. thread as needed. For example, a target interface module 

The Resp'tti read Busy signal is a vector having one signal 65 receives a CONNID of "7", and then maps CONNID 7 to 

per thread, and a signal asserted indicates that the associated thread "0". Thereafter, all transfers with CONNID 7 are 

thread is busy, associated with thread 0 until connection 7 is closed. 
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Referring to FIG. 12, an example of a use of Thread ID, 
consider a series of identical direct memory access (DMA) 
engines in a system. In FIG. 12, elements 1202 are identical 
DMA engines, each connected to an initiator interface 
module 1204. Initiator interface modules 1204 are con- 5 
nected to shared communications bus 1212. Target interface 
module 1206 is also connected to shared communications 
bus 1212 and transmits data from bus 1212 to DRAM 
controller 1208, which is a target functional block. Target 
interface module 1206 is connected to DRAM controller no 
1208 by interconnect 1214. DRAM controller 1208 controls 
access to DRAM 1213. 

A DMA a engine is an example of an initiator functional 
block that also functions as a target functional block. When 
the DMA engine is programmed by software, it acts as a is 
target. Thereafter, the DMA engine is an initiator. Because a 
DMA engine performs both read and write operations, two 
connections can be associated with a single DMA engine. If 
some buffering is available in the DMA engine, read and 
write operations may be decoupled so that both types of 20 
operations can be performed concurrently. A read may occur 
from a long latency storage device which requires the read 
data to be buffered on the DMA engine before a write 
operation writes the data. In one embodiment, each of DMA 
engines 1202 uses a Thread ID to identify the read stream 25 
and a different Thread ID to identify the write stream. The 
DMA engine does not require more information, such as 
what other functional block participates in a transaction. 
Therefore, a CONNID is not required to be sent from the 
DMA engine 1202 to a connected interface module 1204. 30 
Mapping of a Thread ID to a CONNID occurs in the 
interface module 1204. 

In one embodiment, each initiator interface module 1204 
maps a unique CONNID to each of two Thread IDs from a 
connected DMA engine 1202. Each of DMA engines 1202 35 
use a single bit, for example, Thread ID of FIG. 11, to 
distinguish between its two threads. For each transfer over 
shared communications bus a unique CONNID is sent to 
target interface module 1206. The CONNID may include 
priority information, for example, assigning high priority to 40 
requests for graphics data. The high priority graphics data 
request is immediately serviced by DRAM controller 1208 
while lower priority request may be required to wait. 

Because intelligence is designed into the interface mod- 
ules and the communications protocols, less intelligence is 45 
required of the functional block such as the DRAM con- 
troller 1208 and the DMA engines 1202. This has the 
advantage of making functional blocks more portable or 
reusable as systems evolve. For example, a DMA engine 
used for a high priority application may be switched with a 50 
DMA engine used for a lower priority application simply by 
changing their respective connected interface modules. 

In one embodiment, target and initiator interface modules 
are programmed at the transistor level so that their precise 
function, including their CONNID assignment, is fixed at 55 
power-up. In another embodiment, the design of interface 
modules is in RAM so that the interface module is a 
reprogrammable resource. In this case, the interface module 
is reprogrammed, including reassignment of CONNIDs, by 
software. 60 

The present invention has been described in terms of 
specific embodiments. For example, embodiments of the 
present invention have been shown as systems of particular 
configurations, including communications buses using par- 
ticular protocols. One of ordinary skill in the art will 65 
recognize that modifications may be made without departing 
from the spirit and scope of the invention as set forth in the 



claims. For example, the present may be used in systems 
employing shared communications structures other than 
buses, such as rings, cross-bars, or meshes. 
What is claimed is: 

1. A communication system comprising: 

at least two functional blocks, wherein an initiator func- 
tional block of the at least two functional blocks sends 
transfer requests to a target functional block of the at 
least two functional blocks, said target functional block 
responding to the transfer requests, by establishing a 
connection, wherein a comection is a logical state in 
which data may pass between the first functional block 
and the second functional block; 

a communication medium configured to carry a plurality 
of signals, wherein the plurality of signals comprises a 
connection identifier that identifies a particular connec- 
tion that a data transfer is part of; 

an initiator interface module coupled to the initiator 
functional block and to the communication medium to 
transfer data between the initiator functional block and 
the communication medium, said initiator interface 
module mapping the connection identifier to a thread 
identifier that indicates a transaction stream the data the 
data transfer is part of, the thread identifier communi- 
cated between the initiator interface module and initia- 
tor functional block; 

a target interface module coupled to the target functional 
block and to the communication medium to transfer 
data between the target functional block and the com- 
munication medium, said target interface module rap- 
ing the connection identifier to a thread identifier that 
indicates a transaction stream the data the data transfer 
is part of, the thread identifier communicated between 
the target interface module and target functional block; 

the connection identifier sent with a transfer request from 
the initiator interface module to the target interface 
module and sent with data transfers between the target 
interface module and the initiator interface module. 

2. The communication of claim 1, further comprising at 
least one bus, each bus coupling an interface module to its 
associated functional block, the bus comprising a plurality 
of signal lines, wherein the thread identifier is communi- 
cated across at least one of the plurality of signal lines. 

3. The communication system of claim 1, wherein the 
thread identifier is sent from the target interface module to 
the target functional block and from the initiator interface 
module to the initiator functional block. 

4. A communication system comprising: 

at least two functional blocks, wherein a first functional 
block communicates with a second functional block by 
establishing a connection, wherein a connection is a 
logical state in which data may pass between the first 
functional block and the second functional block; and 

a communication medium configured to carry a plurality 
of signals between interface modules; 

an initiator functional block configured to send transfer 
requests; 

an initiator interface module coupled to the initiator 
functional block and to the communication medium; 

a target functional block that responds to transfer 
requests; 

a target interface module coupled to the target functional 
block and to the communication medium; 

a connection identifier configured to be sent with a 
transfer request from the initiator interface module to 
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the target interface module, the connection identifier allowing an initiator functional block to issue a first 

comprising a multi-bit value that encodes information transfer "Y" if the transfer "Y" is an oldest, non-issued, 

including a transfer priority, a transfer order, and a non -retired transfer among a set of transfer requests 

functional block that originated the transfer, the con- w i tn a same connection identifier as the transfer "Y"; 

ncction identifier is one of a plurality of connection 5 anc j 

identifiers associated with the initiator functional block , hfi tona i b . ock g i ving a BUSY response to 

and ,s mapped to a thread identifier by the .n.t.ator u traQsfer ft fa . , ined ^ transfer „ x „ 

interface module. A i . 4 .„ . t - MV „ 

5. The communication system of claim 4, wherein the a f nc ! has a ™? connection identifier as the transfer 'X 
connection identifier is one of a plurality of connection 1rt l [ the lar S et f ™* l0nal * lock S ives a bus y res P onse <° 
identifiers associated with a target functional block that 10 the tran f f ^ r X so lhat an initiator ^initiating the 
supports simultaneous connections, and wherein the target transfer"X" may not retire the transfer "X"; 
functional block acquires a connection identifier of an open wherein a transfer "Y" that is issued after a transfer "X" 
connection and maps the connection identifier to a thread is older than the transfer "X", and has a same connec- 
identifier. ^ on identifier as the transfer "X" is considered not 

6. The communication system as set forth in claim 4, 15 pipelined with the transaction "X". 

wherein the thread identifier is configured to be sent from the 10- The method of claim 9, further comprising the step of 

target interface module to the target functional block and allowing the initiator functional block to issue the transfer 

from the initiator interface module to the initiator functional "Y" if every non-retired transfer with the same connection 

block, and the connection identifier is configured to be sent identifier is older Lhan the transfer "Y" and is currently 

from the target interface module to the target functional 20 issued to a same target functional block as the transfer "Y". 

block and from the initiator interface module to the initiator 11 The method of claim 9, wherein if the transfer "Y" is 

functional block. issued, the transfer "Y" is considered pipelined with the 

7. The method as set forth in claim 4, further comprising older > non-retired transfers. 

a thread identifier configured to be communicated across the 12. The method of claim 11, wherein a target functional 

communication medium, the thread identifier indicating that 25 bl° c k determines whether a transfer is a pipelined transfer 

a transaction stream that the data transfer is part of; based upon when the transfer occurs and upon a connection 

wherein the connection identifier is mapped from a thread identifier associated with' the transfer. 

identifier by the initiator interface module. 13 ' ^ method of claim 9, further comprising the steps 

8. A communication system comprising: 3Q °f' 

at least two functional blocks, wherein a first functional an i mtiator functional block maintaining a time-ordered 

block communicates with a second functional block by ? ueue of desired transfers with a same connection 

establishing a connection, wherein a connection is a identifier, 

logical state in which data may pass between the first the initiating functional block marking a transfer as non- 
functional block and the second functional block; and 35 issued and non-retired as it is entered into the queue, 
a communication medium coupled to interface modules 14 ^ method of claim 13, further comprising the steps 
and configured to carry a first plurality of signals °f ; 

between modules; if a next oldest entry is non-retired and addresses a same 
at least one bus, each bus coupling an interface module to tar S et functional block, marking the transfer as pipe- 
its associated functional block, the bus comprising a 40 lined; else 

plurality of signal lines, wherein the plurality of signal marking the transfer as non-pipelined, 

lines comprises a thread identifier (ID) that indicates a 15. The method of claim 14, further comprising the step 

transaction stream that the data transfer is part of, a of > when a transfer issues, marking the transfer as issued, 

request thread ID signal that indicates a thread number l fi - The method of claim 15, further comprising the step 

associated with a current transaction intended for a 45 °U wnen a transfer is completed, marking the transfer as 

target functional block, a response thread ID signal that non-issued. 

indicates a thread that a transfer from the target func- 17- The method of claim 16, further comprising the step 

tional block is part of, a request thread busy signal that of > if the transfer is successfully completed, marking the 

indicates to an initiator functional block that the target transfer as retired;-and deleting the transfer from the queue, 

functional block cannot receive new requests associ- 50 i8 The method of claim 17, further comprising the step 

ated with certain threads, and a response thread busy °f. if the transfer is not successfully completed, 

signal that indicates that the initiator functional block re-attempting the transfer. 

cannot receive any new responses from the target 19. The method of claim 9, further comprising the step of 

functional block that are associated with certain the target functional block maintaining a time-ordered queue 

threads. 55 having a depth that is a number of bus clock cycles between 

9. A method for communicating between a plurality of a request for a transfer and a response to the request, 
functional blocks in a computer system, the method com- 20. The method of claim 19, further comprising the steps 
prising the steps of: °f ; 

establishing a plurality of connection identifiers, wherein on each cycle of the bus clock, retiring an oldest entry in 

each connection identifier associates a particular data 60 the time-ordered queue; and 

transfer with a particular connection, wherein a con- on each cycle of the bus clock, adding a new entry to the 

nection is a logical state* in which data may pass time-ordered queue, including a connection identifier 

between an initiator functional block of a plurality of associated with a current request for a transfer, 

functional blocks and a target functional block of the 21. The method of claim 20, further comprising the steps 

plurality of functional blocks, and wherein a connec- 65 of: 

lion is established when a particular data transfer is if a current request for a transfer contain s a valid transfer 

initiated; that selects the target functional block, allowing a 
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FIRST bit and a BUSY bit of an entry in the time- 
ordered queue to be set, wherein a set FIRST bit implies 
that an associated transfer is a first transfer of a set of 
potentially pipelined transfers; else 
clearing the FIRST bit and the BUSY bit. 

22. The method of claim 21, further comprising the step 
of setting the FIRST bit if: 

no transfer in the time-ordered queue is earlier than a 
current transfer, has a same connection identifier as the 
current transfer and has an associated FIRST bit set; 
and 

the current transfer will receive a BUSY response due to 
a resource conflict. 

23. The method of claim 21, further comprising the step 
of setting the BUSY bit if: 

the target functional block has a resource conflict; or 
an earlier transfer in the time-ordered queue has an 

associated FIRST bit set and has a same connection 

identifier as a current transfer. 
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24. The method of claim 23, further comprising the step 
of using a connection identifier to enforce ordering among 
transfers. 

25. The method of claim 24, further comprising the step 
5 of: 

in response to a first request for a data transfer issued in 
a first bus cycle, the target functional block setting a 
BUSY bit in a first time-ordered queue entry, wherein 
10 a first connection identifier is associated with the first 
request; and 

in response to a second request for a data transfer in a next 
bus cycle subsequent to the first bus cycle, the target 
15 functional block clearing a BUSY bit in a second 
time-ordered queue entry and performing an action in 
connection with executing the data transfer requested in 
the second request. 
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UNITED STATES PATENT AND TRADEMARK OFFICE 

CERTIFICATE OF CORRECTION 



PATENT NO. : 6,182,183 Bl Page 1 of 1 

DATED : January 30, 2001 

INVENTOR(S) : Wingard et al. 



It is certified that error appears in the above-identified patent and that said Letters Patent is 
hereby corrected as shown below: 



Column 13, 

Line 18, delete "Dataln", insert -- Dataln --. 
Column 16, 

Line 11, delete "comection", insert ~ connection --. 
Line 31, delete "raping", insert - mapping --. 



Column 17, 

Line 23, delete "method", insert communication system --. 

Lines 23-24, delete "further comprising a", insert - wherein the --. 

Line 24, insert -- is -- before configured. 

Line 25, delete "that". 

Line 26, delete "the" and insert - a «. 



Column 18, 

Line 19, before "and", delete "-". 



Signed and Sealed this 
Twenty-sixth Day of November, 2002 



Attest: 




JAMES E. ROGAN 

Attesting Officer Director of the United States Patent and Trademark Office 
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